A Game Theoretic Approach 
to Quantum Information 



Xianhua Dai and V. P. Belavkin 
School of Mathematical Sciences 
University of Nottingham, UK 

Abstract 

This work is an application of game theory to quantum informa- 
tion. In a state estimate, we are given observations distributed accord- 
ing to an unknown distribution Pg (associated with award Q), which 
Nature chooses at random from the set {Pq : 9 £ Q} according to a 
known prior distribution /x on @, we produce an estimate M for the 
unknown distribution Pg, and in the end, we will suffer a relative en- 
tropy cost TZ{P; M), measuring the quality of this estimate, therefore 
the whole utility is taken as P ■ Q — TZ{P; M). 

In an introduction to strategic game, a sufficient condition for min- 
imax theorem is obtained; An estimate is explored in the frame of 
game theory, and in the view of convex conjugate, we reach one new 
approach to quantum relative entropy, correspondingly quantum mu- 
tual entropy, and quantum channel capacity, which are more general, 
in the sense, without Radon-Nikodym (RN) derivatives. Also the 
monotonicity of quantum relative entropy and the additivity of quan- 
tum channel capacity are investigated. 

1 Introduction 

Much of quantum information has been concerned with the estimate. 
We are given observations distributed according to an unknown distri- 
bution Pg (associated with award Q), which Nature chooses at random 
from the set {Pg : 6 £ 0} according to a known prior distribution fi 
on O, we produce an estimate M for the unknown distribution Pg. 
In the end, we will suffer a cost, measuring the quality of this esti- 
mate, therefore the whole utility is in terms of award and cost. One 
such cost function is relative entropy function TZ{P; M), important in 
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several fields, such as information theory, data compression, compu- 
tational learning theory, game theory, statistics, statistical mechanics, 
and econometrics. 

In the source coding interpretation of the estimate, the minimax 
value of this game can be interpreted as the capacity of the channel 
from @ to X; In computational learning theory, the minimax value of 
this game is the utility shared by an adaptive algorithm, predicting 
each observation before it arrives on the previous observation, com- 
pared to an algorithm predicting after knowing the real distribution; In 
gambling theory and mathematical finance, the relative entropy mea- 
sures the expected reduction in the logarithm of compounded wealth 
due to lack of knowledge of the true distribution, thus the minimax 
value of this game is the practical compounded wealth. 

In this paper, an introduction to strategic game will be briefly 
given, during which a sufficient condition for minimax theorem is ob- 
tained; an estimate is explored in the frame of game theory, and in 
the view of convex conjugate, we reach one new approach to quantum 
relative entropy, quantum mutual entropy, and quantum channel ca- 
pacity, which are more general, in the sense, without Radon-Nikodym 
(RN) derivatives. Also the monotonicity of quantum relative entropy 
and the additivity of quantum channel capacity will be discussed. 

The structure of the paper is organized as follows: In the second 
section, wc will give a brief introduction to strategic game, during 
which a sufficient condition for minimax theorem is obtained; In the 
third section, we will introduce convex conjugate along with some 
examples, mainly on its important mathematical properties for our 
application; In the fourth section, we introduce an estimate in the 
frame of game theory with the cost of classical relative entropy and 
reach one new approach to classical relative entropy in the view of 
convex conjugate; Similarly, we introduce this approach to quantum 
relative entropy, in the section five, especially the monotonicity of 
quantum relative entropy is discussed, and further one approach to 
quantum mutual entropy will be given in the sixth section; The section 
seven is for quantum channel capacity and its additivity; Final section 
is for conclusion and possible further problems. 

2 Strategic Game 

Extending the simpler optimization approach developed in variation 
methods and mathematical programming in mathematics, optimiza- 
tion theory and algorithms in information and computer science, and 
operation research in neoclassical economics, game theory studies sit- 
uations where multiple players make decisions in an attempt to max- 
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imize their returns. The essential feature is to provide a formal mod- 
eling approach to social or informational situations in which decision 
makers interact with other agents. 

Some game theoretic analysis appear similar to decision theory, 
but game theory studies decisions made in an environment in which 
players interact. Alternatively, game theory studies choice of optimal 
behavior when costs and benefits of each option depend upon the 
choices of other individuals. In this sense, game theory, much more 
than decision theory, is of similar spirit and situation to information 
and computer science. 

Since Von Neumann and and O. Morgenstern's classic Theory of 
Games and Economic Behavior [NM 1944] in 1944, there are many 
introductions to game theory, such as [OR 1994]. In this section, we 
briefly introduce strategic game, mainly general definitions, existence 
theorems, and competitive game. 

2.1 General Definitions 

In game theoretic models, the basic entity is a player. A player may be 
interpreted as an individual or as a group of individuals making a de- 
cision. Once we define the set of players, we may distinguish between 
two types of models: those in which the sets of possible strategies of 
individual players are primitive; those in which the sets of possible 
joint strategies of groups of players are primitive. Models of the first 
type are referred to as " noncooperative" . 

A strategic game is a model of interactive decision-making in which 
each decision-maker chooses his plan of strategy once for all, and that 
these choices are made simultaneously. The model consists of a finite 
set N of players and, for each player i, a set of Ai of strategies and a 
utility function on the set of strategy profiles Ai x . .. x A^. 

Definition 1 ([Borel 1953], [Neumann 1928]) A non-cooperative 
finite strategic game consists of 

• a finite set N ( the set of players) 
and for each player i ^ N, 

• a set Ai (the set of strategies available to player i on strategy 
profile) 

• a payoff function Ui : A ^ R, where A = XjL-^^Aj (the payoff of 
player i). 

Denote by Sj the set of probability measures over Ai, which are 
player z's mixed strategies. And denote by the suffix —i all players 
except i. 
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In the play of a strategic game, each player holds the correct ex- 
pectation about the other players' behavior and acts rationally, thus 
a steady state is reached. If not attempting to examine the process 
by which a steady state is reached, we call it Nash equilibrium. 

Definition 2 ([Nash 1950]) A mixed Nash equilibrium of a finite 
strategic game {N, (Ai), (uj)) is a vector (tti, 7r2, . . . , 7r„), with tt^ G 
for all i e N, such that 

^aeAT^i{ai)TT-i{a-i)ui{ai, a-i) > Y.a£APi{ai)TT-iia-i)ui{ai, a-i) (1) 

for all Pi G Sj and for all i £ N. 

Therefore, pure-strategy Nash equilibria are those which only in- 
volve degenerate mixed strategies. 

The following restatement of the definition is useful elsewhere. 

Definition For any a_j G A^i, we define Bi{a-i) best actions to 
be the set of player i's given a^f. 

Bi{a-i) = {ai e Ai : Ui{ai,a-i) > Ui{ai,a-i)}. (2) 

The set-valued function Bi is called the best-response function of 
player i. Therefore, a Nash equlibrium is a profile a* of actions for 
which 

e BiiaU), (3) 

for all i G A^. 

This alternative definition formulation points us to a (not neces- 
sarily efficient) method of finding Nash equilibria: At first to calculate 
the best response function of each player, then to find a profile a* of 
actions for which a* G Bi{a*_j) for all i G A". Obviously, if the function 
Bi are singleton- valued, the second step deduces to solve | A^| equations 
in the \N\ unknowns {a*)i^N- 

2.2 Existence Theorems 

An existence result has two purposes: At first, if wc have a game 
satisfying the hypothesis of the result, it is hopeful to find an equi- 
librium; Secondly, the existence of an equilibrium ensures the game 
consistent with a steady state solution; Furthermore, the existence of 
an equilibria for a family of games allows us to study properties of 
these equilibria without finding them explicitly and without the risk 
to study the empty set. 

It is extensively investigated that under which conditions the set of 
Nash equilibria of a game is nonempty. We here just introduce one of 
the simplest existence result, whose mathematical level is much more 
advanced. 
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To prove that a Nash equihbrium exists for a game, it suffices to 
show that there is a profile a* of actions such that a* G Bi{a*_i) for 
all i G iV, which is a* G B{a*), if wc define the set-valued function 
B : A — > A by B(a) = Xi^pfBi{a^i). Luckily fixed point theorems 
give conditions on B under which there exists a value of a* for which 
a* G B{a*). Generally we apply the following fixed point theorem. 

Theorem ([Kakutani 1941]) Let X be a compact convex subset 
of and let / : X — > X be a set-valued function for which 

• for all X G X, the set f{x) is nonempty and convex; and 

• the graph of / is closed (i.e. for all sequences and 
such that {un} G fi{xn}) for all n, Xn — > x and yn — > y, we 
have y G /({x})). 

Then there exists a x* e f{x*). 

Theorem ([Nash 1950, 1951]) A Nash equilibrium of strategic 
game {N, (Aj), (uj)) exists if for all i e N, 

• the set Ai of actions of player i is a nonempty compact convex 

subset of a Euclidian space; and 

• the utility function Ui is continuous and quasi-concave on Ai. 

Proof: Let set-valued function Bi the best-response function of 
player i, we define B : A — > A by B{a) = Xi^j\[Bi{a-i). 

For every i e N, the set Bi{a-i) is nonempty since the utility 
function Ui is continuous and the set Ai is compact, and also is convex 
since the Titility function is quasi-concave on Ai] B has a closed 
graph since each utility function Ui is continuous. 

Following the Kakutani's fixed point theorem, B has a fixed point; 
any fixed point is a Nash equilibrium of the game as noted. 

This result states that a strategic game satisfying certain condi- 
tions has at least one Nash equilibrium; but a game may have more 
than one equilibrium. Note that this theorem does not apply to any 
game in which some player has finitely many actions, since the set 
of actions of every player is not convex, but a mixed strategy Nash 
equilibrium of every finite strategic game {N, {Ai), (ui)) always exists. 

Theorem ([Nash 1950]) A mixed strategy Nash equilibrium of 
every finite strategic game {N, (Ai), (ui)) always exists. 

Proof: Let G = {N, (Ai), (ui)) be a strategic game, and for each 
player i, let rrii be the number of members of the set Ai, then we 
identify the set A{Ai) of player i's mixed strategies with the set of 
vectors {pi,P2, ■■■,Pm,) for which pk>0 for all k and YJkLiPk = 1 {Pk 
being the probability with which player i uses his iih pure strategy). 
This set is nonempty, convex, and compact. Since expected payoff is 
linear in the probabilities, each player's payoff function in the mixed 
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extension of G is quasi-concave in his strategy and continuous. Thus 
a mixed strategy Nash equihbrium exists due to above theorem. 

2.3 Competitive Game 

Little obtained on the set of Nash equihbria of an arbitrary strate- 
gic game, we discuss strictly competitive games and its qualitative 
character of the equilibria. 

Definition A strategic game ({1, 2}, {Ai), (ui)) is strictly compet- 
itive if for any a E A and 6 G ^, we have ui{a) > ui{b) if and only if 
U2{a) < ui{b). 

Player i maxminimizes if he chooses an action best for him on the 
assumption that whatever he does, player j will choose her action to 
hurt him as much as possible. We will find that for a strictly comple- 
tive game possessing a Nash equilibrium, a pair of actions is a Nash 
equilibrium if and only of the action of each player is a maxminimizer, 
a striking result since providing a link between individual decision- 
making and the reasoning behind the notion of Nash equilibrium, dur- 
ing which we also find that for a strictly completive game possessing 
Nash equilibria yield the same payoffs. 

Definition Let G = ({1, 2}, (^j), (ttj)) be a strictly completive 
game, the action x* G is a maxminimizer for player 1 if for all 
X G Ai, 

m.mui{x*,y) > mm ui{x,y). (4) 
yeA2 yeA2 

Similarly, the action y* G ^2 is a maxminimizer for player 2 if for all 
y e A2, 

min U2{x,y*) > mm U2{x,y). (5) 

Theorem ([Neumann 1928]) Let G = ({1, 2}, (Ai), (u,)) be a strictly 
completive game. 

a) If (x* ,y*) is a Nash equilibrium of G, then x* is a maxminimizer 
for player 1 and y* is a maxminimizer for player 2. 

b) If {x*,y*) is a Nash equilibrium of G, then 

max min til (x, y) = minmaxtti(a;, y) = ui(x* ,y*), (6) 
X y y X 

and all Nash equilibria of G yield the same payoffs. 

c) If maxj; miuy ni(,T, y) = min^^ max^; y), x* is a maxmini- 
mizer for player 1, and y* is a maxminimizer for player 2, then (x*,y*) 
is a Nash equilibrium of G. 

Proof: First to prove (a) and (b). Let {x*,y*) is a Nash equilib- 
rium of G, then U2{x*,y*) > U2{x*,y) for all y G ^2, or ui{x*,y*) < 
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ui(x*,y) for all y £ A2. Hence 

ui{x*,y*) = min ui (x* , y) < maxmin'Ui(a;, y). (7) 
y X y 

Similarly, 

> maxminui(x,y). (8) 

X y 

Thus ui{x*,y*) = maxa; miiij^ y) and x* is a maxminimizer 
for player 1. Similar argument for player 2, y* is a maxminimizer for 
player 2 and ^2(0::*,^*) = maxj^ min^; «2(x, y). 

Now to prove (c). Let v* = max^^ minj, ui{x, y) = min^ max^^ ui{x^ y), 
for a strictly completive game, we have —v* = maxj, min^; tt2(x, y). 
Since x* is a maxminimizer for player 1, we have ui{x*,y) > v* for 
all y E A2; similarly, U2{x,y*) > —v* for all x & Ai. Taking y = y* 
and X = X* in those two inequalities, we have v* = ui{x* ,y*), again 
considering this strictly completive game, we reach that {x*,y*) is a 
Nash equilibrium of G. 

Following part (c), a Nash equilibrium can be found by solving the 
problem 

maxmintti(x,y); (9) 

X y 

following part (a) and (c), Nash equilibria of strictly completive game 
are interchangeable: if {x,y) and {x',y') are equilibria, so are {x,y') 
and (x',y); following (b), for any strictly competitive game with a 
Nash equilibrium, 

maxmintii(x, y) = minmaxiti(x, y) = ui{x*,y*). (10) 
X y y X 

If maxa; miuj^ y) = min^^ max^: ■Ui(x, y) = ui{x*,y*), we say 
this equilibrium payoff of player 1 is the value of the game. 

Theorem Let Ai, A2 be non-empty, convex and compact subsets 
of M" for some n. Let payoff u : Ai x A2 — > M be a continuous 
function, such that 

• Va2 G A2, the set {ai G Ai : ^(01,02) > ■u(a']^,a2), Vo'^ € ^1} is 
convex; and 

• Voi G Ai, the set {02 € A2 ■ u{ai,a2) < ^(01,02), G A2} is 
convex. 

Then there exists an a* G Ai x ^2, such that 

max mm u{ai,a2) = u{a*) = mm max 11(01,02). (11) 

Proof: At first, continuous payoff function u is quasi-concave with 
respect to two arguments, since Va2 G A2, the set {ai G Ai : u{ai, 02) > 
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u(a'i, 02), Va'^ G ^i} is convex, and Vai G Ai, the set {02 G ^2 : 
u(ai,a2) < ^(ai, 02), € A2} is convex. 

Following [Nash 1950, 1951], a Nash equilibrium a* e Ai x A2 of 
this strategic game exists. 

Further according to [Neumann 1928], for this competitive game, 

maxmin'u(x, y) = min max u{x,y) = u{a*). (12) 
X y y X 

3 Convex Conjugate 

In mathematics, convex conjugation, as a generalization of the Legen- 
dre transformation (in which sense, is also taken as Lcgcndrc-Fcnchel 
transformation or Fenchel transformation elsewhere), addressed much 
attention in the study of extremum problems, among which are sys- 
tem inequalities, the minimum or maximum of a convex function over 
a convex set, Lagrange multipliers, and minimax theorems. 

There are excellent books on the introduction to convex analysis, 
such as [Rockafellar 1970] for pure mathematics, [Arnold 1989] for 
application in theoretic mechanics. In this section, we simply overview 
convex conjugation, first on its definition including some examples, 
then on some properties for our application. 

3.1 General Definition 

Definition Let X be a linear normed space, and X* the dual space 
to X, we denote the dual pairing by 

{.,.): X*xX — >R. (13) 

Given a function / : X — > M. U {+00} taking values on the extended 
real number line, we define the convex conjugate /* : X* — >■ R U 
{+00} by 

fix*) = sup{{x*,x) - fix)\x G X}, (14) 
or, equivalently, by 

r{x*) = -inf{/(x) - {x*,x)\x e X}. (15) 

We consider convex conjugates for some examples, via simple com- 
putations, following the above definition. 

Example 1 An affine function is generally written by 

f{x) = {a,x) -b,aeW,beR. (16) 
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Then its convex conjugate f*(x*), denoted by Oa{x*), is 

r{x*) = Oa{x*) = I " ''*rf 

I OO X a. 
Example 2 The norm function is generahy written by 

fix) = \\x\\. (17) 
Then its convex conjugate f*{x*), denoted by Oi{x*), is 



r{x*) = o,{x*) 



a \\x*\\ < 1; 
oo ||x*|| = 1. 



Example 3 Let K C X he a convex subset and e{x) be the cali- 
bration function 

e{x) = sup{{x*,x)\x* eK}. (18) 

Then its convex conjugate e*{x*) is Ok{x*), where Ok{x*) is de- 
fined as follows. 



e*ix*)^OKix*) 



X* eK; 
oo X* e K'^, 



where is the complement of K. 

Example 4 The convex conjugate f*{x*) of exponential function 
f{x) = is 

{x* Inx* - X* x* > 0; 
x* = 0; 
oo X* <0. 

Let a cone X+ be {a; > 0} C X, and X^ = {x* G X* : {x*,x) > 0} its 
dual cone. Then the convex conjugate f*{x*) of exponential function 
f{x) = on X+ is 



fix* 



X* Inx* - X* X* > 0; 
x* = 0. 



3.2 Some Properties 

Theorem The conjugate function of a closed convex function is a 

closed convex function. 

Proof: For every t G MP|[0, 1], and every x*,y* G X*, according to 
the definition of convex conjugate, 

ntx* + (1 - t)y*) = sup{{tx* + (1 - t)y*,x) - f{x)\x G X} (19) 
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= sup{{{tx*,x) - tf{x)) + (((1 - t)y*,x) - (1 - t)f{x))\x G X] (20) 
< tsvip{{x\x)-f{x)\x G X} + (l-t)sup{(y*,x)-/(x)|x G X} (21) 
= tr{x*) + {l-t)ny*). (22) 

Theorem (Order-reversing) Convex-conjugation is order-reversing, 
i.e., ii f < g, then /* > g*, where f < g means for every x £ X, 
fix) < g{x). 

Proof: Since f < g, then for every x e X, 

fix) < g{x). (23) 

According to the definition of convex conjugate, for every x* G X*, 

r{x*) = sup{{x*,x)-f{x)\xeX} (24) 

> sup{(a;*, a;) - g{x)\x G X} (25) 
= g*{x*). (26) 

Thus /* > g*, since every x* e X*. 

Theorem (Biconjugate) The convex conjugate of a function is 
lower semi-continuous. The biconjugate /** (the convex conjugate of 
the convex conjugate) is the closed convex hull, that is, the largest 
lower semi-continuous convex function smaller than /. Furthermore, 
for proper functions /, / = /** if and only if / is convex and lower 
semi-continuous. 

Proof: For every x* < Xq E X* , 

r{x*) = sup{{x*,x)-f{x)\xeX} (27) 
<suv{{x*o,x)-f{x)\xeX} (28) 

^r(^S), (29) 

which implies that the convex conjugate of a function is lower semi- 
continuous. 

Theorem (Fenchel's inequality or Fenchel- Young inequality) For 
any proper convex function / and its convex conjugate /*, Fenchel's 
inequality holds: 

{p,x)<f{x) + np), (30) 

for every x G X,p G X*. 

Proof: According to the definition of convex conjugate, for every 
X G X,p G X*, 

fix) + f*{p) = fix) + snp{{p,x) - fix)\x G X} (31) 

>f{x) + {{p,x)-f{x)) (32) 
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= {p,x). (33) 

Theorem (Infimal convolution) Let /i,...,/^ be proper convex 
functions on X. Then 

(/l *inf - *inf fruT = fl + ■■■ + tm. (34) 

where the infimal convolution of two functions / and on X is defined 
as 

{f^inig){x) ^ inf{/(.T - y)+9{y)\y G X}. (35) 
Proof: Here we just consider the case for m = 2, for x G X 

(/*inf 5)*(^*) = (}^i{f{x-y)+g{y)\y G Xfix*) (36) 

= sup{(x*,x) - inf{/(x -y)+ g{y)\y G X}\x G X} (37) 

= sup{(a;*, {x-y)+y) - f{x - y) - g{y)\x, y e X] (38) 

= sup{(x*,x - y) - f{x - y)\x,y G X} + sup{(x*,y) - g{y)\y G X} 

(39) 

= r{x*)+g*{x*y, (40) 
Since the infimal convolution is associative, i.e., 

[(/l *inf ••• ^inf /m-l) *inf fm]* = (/l *inf ••• *inf fm-l)* + fm^ (41) 

the theorem follows from mathematical induction for general m. 

4 Classical information 

In classical information theory, we need find fundamental limits on 
compressing and reliably communicating classical data. A key mea- 
sure of information is known as information entropy, usually expressed 
by the average number of bits needed for storage or communication. 

This section introduces basic classical information quantities, say 
Shannon entropy, relative entropy, see, for example, [Shannon 1948], 
for reference. Wc first overview basic mathematical forms of Shannon 
entropy and relative entropy, then explore classical estimation in the 
frame of game theory, and reach a new approach to relative entropy 
in the view of convex conjugate. 
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4.1 Classical Relative Entropy 



Suppose there is a random variable with true distribution F (for the 
density /). Then we could represent that random variable with a code 
of average length H{F), where Shannon entropy H{F) is expressed in 
mathematics as follows. 

Definition (Shannon Entropy H{F) of the distribution F) 



However, due to incomplete information (wc do not know F re- 
ally), we assume G the distribution of the random variables instead. 
Then the code would need more bits to represent the random variable. 
The difference, in the number of bits, denoted by TZ{F;G), between 
a "true" probability distribution F and an arbitrary probability dis- 
tribution G is known as the relative entropy [Shannon 1948], or the 
KullbackLeibler divergence, information divergence, information gain 
in probability theory and information theory, which is expressed in 
mathematics as follows. 



Definition (Relative Entropy TZ(F; G) of probability distributions 
F and G) 



where / and g are the respective densities with respect to any domi- 
nating measure. 

Though the relative entropy TZ{F; G) is often taken as a distance 
metric, but it is not a true metric, since it is not symmetric between 
distribution F and G. 

There may be some interpretations for relative entropy, for ex- 
ample, the relative entropy 7^(-F; G) may be interpreted as the error 
exponent for the hypothesis test F versus G. 

4.2 Classical Estimate 

In a classical estimate, we are given classical observations distributed 
according to an unknown distribution Pq ^ X ^ I associated with 
award Q, which Nature chooses randomly from the set {Pg : G 0} 
according to a known prior distribution /i on 0, we produce an esti- 
mate M for the unknown distribution Pq, denoted by P later without 
notation confusion. In the end, we will suffer a relative entropy cost 
Tl{P; M), measuring the quality of this estimate, thus the whole utility 



In this section, we will investigate the existence of minimax value 
of this utility, correspondingly its minimax strategy. 




(42) 




(43) 



is P-Q-n{P;M). 
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We consider the utility P-Q—TZ{P; M), then the estimate problem 
is in fact the following optimization problem 

min max \P ■ Q - n(P; M)]. (44) 

M>0,M-I=1P>0,P-I=1 

Considering the convex conjugation Tl\^{Q) of Tl{P;M) with re- 
spect to P, that is, 

we can rewrite the above estimate problem as follows. 

min max \P ■ Q - n(P- M)] = min TVLiQ). (46) 

M>0,M-/=1 P>0,P-J=1 M>0,M-/=1 

Remark If we take function TZiiP) as follows. 

V (P)-l nP;M) p. 1=1; 

Then we can write in the following form. 

max[P•Q-7^l(P)] =7^1^(Q), (47) 

where 7^l(P) = n{P;M) + Oai(P), and the hyperplane Ai = {P : 
P-I = 1}. 

Applying the Lagrange Theorem, we obtain the following result. 
7^M(Q) = max [P•Q-7^M(P)] = minmax[P•Q-7^l(P)+A(P•/-l)l. 

p>o,P-I=l'- A P>0 \ / V /J 

(48) 

Since the function P ■ Q — TZi{P) + A(P • J — 1) is linear with 
respect to A and convex with respect to P, the min and max can be 
exchanged, i.e., 

minmax[P•g-7^l(P)+A(P•I-l)l = maxmin^P•Q-7^l(P)+A(P•/-l)l, 
\ P>0 \ / \ li A \ / \ 

(49) 

therefore we obtain the following result. 

n*M(Q)= max [P•Q-7^l(P)l =maxmin[P•g-7^l(P)+A(P•/-l)l. 

(50) 

Remark If considering this optimization problem, at first, with 
respect to A, that is, 

max[P • Q - 7^l (P) + Oa, (P)] = max[P • Q - 7^l (P) + min A(P • / - 1)1 , 
P>0 P>0 A 

(51) 
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we obtain the following result. 

7^^(Q) = max[P • Q - 7^l(P) + min A(P • / - 1)]. (52) 

To consider the optimization problem 

mm max[P • Q - 7^l (P) + A(P • / - 1)] , (53) 

applying the variational method, it suffices to consider the function 

c/ = p•g-7^l(p) + A(p•/-l), (54) 

or, equivalently, the function 

5;P,(Q, + ln-^ + A)-A, (55) 



and we obtain the result. 



SU = 6P,{Q, + In ^ + A - 1) + 5X{J2 ' 1)> (56) 
from which we obtain the following result. 

+ In ^ + A - 1 = 0. (57) 

Therefore, we reached following two results. 

P; = M^-exp(Q^ + A-l), (58) 

t^*m{Q) = p*Q- nM{P*) = -J2^^ ^^P(<3r^ + A - 1)(A - 1). (59) 

X 

Considering further P • I = 1, we obtain the following results. 

M^-expQ^ 

^m(Q) = P* ■Q-nM{P*) = In(^M^expQ^). (62) 

Remark It is easy to see that TZm{P*) = P* ■ Q ~ ^m(Q) is 
the classical relative entropy under the maximal utility, or classical 
relative capacity under given utility. 
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Since the set {M : M > 0,M ■ I = 1} is convex, the minimum 
of function TIm{P*) always exists with respect to M. Therefore, we 
reach the fohowing main result. 

Theorem The minimax value, associated with the above estimate 
game, defined by 

M>0,M-7=1 p>o,p.7=l 

makes sense, and so does its minimax strategy. 

Remark We can similarly define the maxmin value, associated 
with the above estimate game, by 

but it is needy to consider if this maxmin value and its maxmin strat- 
egy always exist, and further if this maxmin value is the same as the 
minimax value when this maxmin value always exists. 



4.3 Convex Conjugate View 

In the above, we applied the convex conjugate TZ*j^{Q) of classical 
relative entropy TIm{P) with respect to P, i.e., 

7^1,(Q) = ^^max^^{^' " Q - 7^m(P)}, (65) 

and obtained the following formula 

n*M{Q) = \n{J2 exp Q^), (66) 

X 

but starting from the result 7Vj^{Q) = ln(^^ cxp Qx) and applying 
the above biconjugate property, we can define the classical relative 
entropy as follows. 

Definition Classical relative entropy 7?.(/9; M) of p relative to M 
is defined as 

7^(p; M) = 7^M(p) = max{< p, Q > -TI*m{Q)}, (67) 

where TZlj{Q) = ln{J2x exp Q^). 

Obviously, the simple computation will give us the mathematical 
form of classical relative entropy. 

Proposition Classical relative entropy of {px} relative to {Mj,} 
is equal to Z)^[ps log(ps/Mj:)], that is, 

7^(p; M) = Y,[Px log(p./M,)], (68) 

X 
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which confirms the unique mathematical form of classical relative en- 
tropy, though still open to explain 7?.^(Q) = ln(^^ M,; exp Q-^) com- 
pletely in information theory. 



Many information measures for Quantum Signals, for example. Von 
Newmann Entropy [Wehrl 1978, OP 1993], Quantum Conditional En- 
tropy [HOW 2005], Quantum Relative Entropy [Umegaki 1962], Quan- 
tum Mutual Entropy [NC 2000], etc, upon information theoretic expla- 
nation and mathematical formula of Von Newmann Entropy. In fact, 
at present stage, no original information-theoretic definition, similar 
to shannon information entropy, is possible even for Von Newmann 
Entropy except for the mathematical formula. 

This section is for one new and general quantum relative entropy, in 
the sense without Radon-Nikodym (RN) derivatives. First we simply 
overview three types of quantum relative entropy, then investigate 
quantum estimate in the frame of game theory, (here following the 
terminology of classical estimate, the same terminology of estimate is 
still used, but in a different sense away from quantum physics), and 
reach one new mathematical form of quantum relative entropy, also 
give some important properties including monotonicity of quantum 
relative entropy. 

Throughout this paper we prefer to use in what is following the 
term "information" for negaentropy, leaving the term "entropy" for 
the opposite quantities like relative negainformation S{tu; (p) = —TZ{tu; (p), 
which coincides with usual von Newmann entropy S{w) if it is taken 
with respect to the trace (p = Tr. 

5.1 Historic Review 

There are several mathematical quantum relative entropies so far, 
mainly Araki-Umegaki type [Araki 1976], Belavkin-Staszewski type, 
Hammersley-Belavkin type [HB 2006]. This part overviews the basic 
definition of quantum relative entropy of Hammersley-Belavkin type 
and reduces to Araki-Umegaki type, Belavkin-Staszewski type. 

We define the Radon-Nikodym (RN) derivatives with respect to an 
arbitrary, not necessarily normalized, density 7 € At (i-e. a positive 
linear functional on A), 



5 Quantum Relative Entropy 



£•7 = 7 




(69) 



?7 = 7 



l/2^;^-l/2 



(70) 
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then we obtain quantum relative entropy of Hammersley-Belavkin 
type (known as 7 type elsewhere) as follows: 

Definition Quantum relative entropy of Hammersley-Belavkin type 
(known as 7 type elsewhere) to compound state w on the algebra 
A®B^ (or information divergence of the state vj with respect to a ref- 
erence state (/?) is defined by the density operator a;, ^ of these states 
vj and 1^ as 

n%{w- ^) = \{V^9{L-yjV^), (71) 

where L~^x = ^~^X is the operator of left multiplication by <^~^ iso- 
morphic to the operator <j~^ and L'^x = XQ is the right multi- 
plication by Q isomorphic to the operator Ia ® Q, g{r) is any strictly 
positive function g{r) at r > 1 with ^(1) = and operator-convex, A 
is a faithful, semi-finite trace. 

Note that this quantity is introduced in [HE 2006] in order to char- 
acterize the "distance" as an information divergence between states. 

Quantum relative entropy of Hammersley-Belavkin type (known as 
7 type elsewhere) includes the other two relative quantum information 
as special cases. 

Corollary If 7 = /, it gives quantum relative entropy of Araki- 
Umegaki type (known as a type elsewhere) to compound state w on 

the algebra A (E) B, (or information divergence of the state w with 
respect to a reference state ip) defined by the density operator a;, ^ of 
these states w and (f as 

7^W(ro;<^) = TV[u;(lnu; -ln(/))]. (72) 

This quantity is used in most definitions of quantum relative en- 
tropy. However unlike the classical case, this is not only possible choice 
for informational divergence of the states w and (/p, and it does not 
relate explicitly the informational divergence to the Radon-Nikodym 
(RN) density = 0~^/^a;(/)~^/^ of the state w with respect to as 
in the classical case. 

Corollary If 7 = cr, it gives quantum relative entropy of Belavkin- 
Staszewski type (known as h type elsewhere) introduced in [HB 2006] 
as 

7^('')(Tu;(/7) = Tr[ij\n{4)-^uj)], (73) 
where a; ln((/>~^c<;) = ln(a;0~^)a; is understood as the Hermitian oper- 

a;V2ln(a;V2<^-i^i/2)^i/2_ (74) 

This relative entropy can be explicitly written in terms of the RN 
density uj^p as 'R^^\'uci\^p) = (p{r{LU^)), where r{LU^) = a..'^ In c^'^ . 

Remark In finite dimensions and faithful states, the Bclavkin- 
Staszewski information divergence based on quantum relative entropy 



17 



of Belavkin-Staszewski type gives better distinction of uo and cp [OP 
1993] in the sense that it is greater than quantum relative entropy 
of Araki-Umegaki type, and that it satisfies the following important 
monotonicity property. 

Remark Quantum relative entropy of Hammersley-Belavkin type 
(known as 7 type elsewhere) is more generous quantum relative en- 
tropy, and includes Araki-Umegaki type (known as a type elsewhere) 
and Belavkin-Staszewski type (known as b type elsewhere), but can 
not exhaust all possibilities expect for commutative algebra, for ex- 
ample, the trace distance TZtriQ',^) = A(|^> — ^|) and the fidelity dis- 
tance 'R-fidiQ] ?) = l~^(l\/^\/^l) of Hammersley-Belavkin type 
(known as 7 type elsewhere) [HB 2006] . Undoubtedly, it is a challenge 
to obtain a general information-theoretic definition for quantum rel- 
ative entropy which reduces to a general mathematical formula for 
quantum relative entropy. 

For quantum relative entropy of the three types, it is easy to obtain 
the well-known monotonicity inequality. 

Theorem Given a normal completely positive unital map K : 
M ^ M'^, ii w = woK,ip = (pqK, then for both relative entropies, 

Tliw; if) < TZ{wo; (po). (75) 

Remark In fact, this monotonicity property is proved in [Lindblad 
1973, Holevo 1998a] for Araki-Umegaki type (known as a type else- 
where). And Belavkin-Staszewski type (known as b type elsewhere) 
and Hammersley-Belavkin type (known as 7 type elsewhere) satis- 
fies a system of properties [HB 2006] for quantum relative entropy 
including this monotonicity inequality. Therefore this monotonicity 
property holds for the three quantum relative entropies. 

5.2 Quantum Estimate 

In a quantum estimate, we are given quantum observations distributed 

according to an unknown distribution Pg & X associated with award Q 
(where X is a predual space of Hermitian IU*-continuous functionals 
on a l^*-algebra X* = A), which Nature chooses randomly from the 
set {Pff : 6 e Q} according to a known prior distribution /x on @, and 
we produce an estimate M for the unknown distribution Pg. In the 
end, we will suffer a relative entropy cost TZ{P;M), measuring the 
quality of this estimate, thus the whole utility is taken as P ■ Q — 
n{P; M). 

There are several mathematical quantum relative entropies, mainly 
Araki-Umegaki type, Belavkin-Staszewski type, Hammersley-Belavkin 
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type, and similarly three different matliematical forms of game tlie- 
oretic utilities. Below we concentrate on quantum relative entropy 
of Araki-Umegaki type, since other two types of quantum relative en- 
tropy are just mathematical complex in Radon-Nikodym (RN) deriva- 
tives but with the same intrinsic method. 

Applying the convex conjugate Ti*M ^{Q) of 7?.(P; M) with respect 
to P, that is, 

n*M,^{Q) = p max [P-Q- n{P- M)], (76) 

F>0,PI=IJ. 

the estimate problem is the following optimization problem. 

min max [P ■ Q - n(P; M)] = min 7^M„(Q), (77) 

D,M-7=1 P>0,P-7=/i V ' /J M>0,M-I=l -'^^'M^^^' ^ ' 



mm 

M>0,. 



where the cost function 71{P; M) here is taken as quantum relative 
entropy of Araki-Umegaki type. 

Remark If taking function Tli{P) as follows. 



n,iP) 



n{P;M) P-I = n; 
oo P-Iy^l. 



Then we can rewrite TI*m ^j^{Q) as follows. 

max[P•Q-7^l(P)]=7^;,,^(Q), (78) 

where 7^l(P) = n{P;M) + Oai{P), and the hyperplane Ai = {P : 
P-I = ^i}. 

Applying the Lagrange Theorem, we will find. 
7^*„(Q)= max [P•Q-7^l(P)] = minmax[P•Q-7^l(P)+A(P•/-//)]. 

(79) 

Since the function P Q — Tl\ (P) + A(P • / — /x) is linear with respect 
to A and convex with respect to P, therefore the min and max can be 
exchanged, that is, 

minmax[P•Q-7^l(P)+A(P•/-u)] = maxmin[P•Q-7^l(P)+A(P•/-u)], 
A P>0 \ / \ i~/J A 

(80) 

we can rewrite T^*m ^(Q) as follows. 

^M«(Q)= max ^P•Q-7^(P;M)] = maxmin[P•Q-7^l(P)+A(P•/-/x)]. 

(81) 

Remark Considering this optimization problem at first with re- 
spect to A, then the following formula is reached. 

T^*mAQ) = max[P•g-7^l(P)+minA(P•/-M)] = max[P•Q-7^(P; M)+Oa, (P)]. 

(82) 
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Applying the variational method to the function U = P ■ Q — 
Tli{P) + \{P ■ I — n), we obtain the following results. 

Q + lnM-lnP+(A-l)/ = 0, (83) 

TvP = n. (84) 

Simple calculations give us results as follows. 

P* = M ■ exp[Q + {X - 1)1], (85) 

T^liJQ) = P*-Q- T^mAP*) = -^W exp(Q + A - 1)(A - 1)]. (86) 
Considering further about two conditions P ■ I = ji and M • / = 1, 
we will find following three resTilts. 

_ M-expQ 
^ -^IV[MexpQ]' 

T^*mAQ) =P*Q- T^mAP*) = Mln[-Tr(MexpQ)]. (89) 

Remark It is easy to see that TZmAP*) = '^*m ^iQ) - P* ■ Q 
the quantum relative entropy under the maximal utility over P > 
and P I = n, or quantum relative capacity under given utility. It 
is one interesting problem to explain ^^^(Q) = /xln[^Tr(MexpQ)] 
completely in quantum information theory. 

Easily we can find the following properties of T^*m f^{Q)- 

Lemma The function TI*m ^iQ) = fJ- ln[iTr(M exp Q)] is monotonous 
with respect to M. 

Lemma The function 7^^^^(Q) = /Ltln[^Tr(MexpQ)] IS concave 
with respect to M. 

Since the set {M : M > 0, M • / = 1} is convex, there always exists 
the minimum of function 1Zm,^i{P*) with respect to M, and upon the 
above lemmas, we reach the following main result. 

Theorem The minimax value, associated with the above problem, 
defined by 

M>0,M-/=1 p>o,p.i=n 

makes sense, and so does its minimax strategy. 

Remark We can similarly define the maxmin value, associated 
with the above problem, by 

V= sup inf [P ■ Q - n(P; M)], (91) 

~ P>0,p.I=f,M>0,M-I=l' 

but it is a problem if this maxmin value and its maxmin strategy exist 
and if this maxmin value is equal to the minimax value. 
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5.3 Convex Conjugate View 

In the above analysis, we applied convex conjugate 71\^{Q) of quantum 
relative entropy TZm{P)-, that is, 

^M,M(Q) = p™ax {P•g-7^M(P)}, (92) 

P>0,PI=iJ. 

and obtained the following formula 

T^*mAQ) = /^ln[-TV(MexpQ)], (93) 
M 

but starting from the result ^{Q) = A*lii[^Tr(Mexp(5)], and ap- 
plies the biconjugate property of convex conjugate, we can define the 
quantum relative entropy Tl^{p\M) as follows. 

Definition Quantum relative entropy TZ^ip; M) of p relative to M 
is defined as 

7^^(p; M) = UmM = max{< p,Q > -7^1^,^(Q)}, (94) 

where TZl^ JQ) = ^ln[iTr(MexpQ)]. 
In particular, for the case p = 1, 

7^l(p; M) = 7^M,l(p) = max{< p,Q > -7^;^(Q)}, (95) 

where 7^^(Q) = ln[Tr(MexpQ)]. 

Obviously, the general mathematical form of this quantum relative 
entropy may not include the mathematical form of the Araki-Umegaki 
type, Belavkin-Staszcwski type, or Hammerslcy-Bclavkin type. 

Here we obtain one property for this quantum relative entropy. 

Theorem 

V/x>0,7^^(p;M) = /x7^l(ip;-M). (96) 

p p 

Proof: According to the definition of quantum relative entropy, 

n^{p;M) = nMAp) (97) 

= max{< p,Q> -pln[-TrMexpQ)]} (98) 
Q fJ' 

= pmax{< -p,Q > -ln[Tr(-Mexpg)]} (99) 

Q jjj jJL 

= pni{-p--M). (100) 
p p 
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5.4 Monotonicity of Quantum Relative Entropy 

To obtain its monotonicity of TZi{p; M) (or TZ{p; M) without notation 
confusion), we need several lemmas in our notation system. 

Let p, be positive trace class operators in a separable Hilbert 
space 7Y, r a trace-preserving from B{H) to a von Neumann subalgebra 
A. 

Lemma l([Kovacs 1966]) Let ^ be a finite von Neumann algebra. 
For each X ^ A and each inner *— automorphism a of A, there exists 
X e A such that ^ Y^n=Q ""(^) — »• X as A/" — > oo in the strong 
operator topology. 

Lemma 2 ([schwartz 1967]) Let A! denote the group of all unitary 
transformations in A, E{A!) the set of all non-negative real valued 
functions on A! which vanish expect at a finite number of points of A! 
and which satisfy 

Take f{X) = ^u^ji^i f{U)UXU* for each bounded operator X, then 
for each bounded operator X, there exists a sequence G E{A') such 
that fn{^) converges weakly to an element of A'- 

Lemma 3 Let X G T{H) be a trace class operators in a separable 
Hilbert space H, and K{X) the weakly closed convex hull of the set 

{UXU-^, unitaiyU e A'}. (102) 

Then 

K{x)nA = {r{x)}, (103) 

{T{Y)} = {T{X)} for all Y G K{X). 

Moreover, let E(A') be the set of nonnegative real functions on 
the set U{A') of unitary operators in A' which are nonzero only on a 
finite number of points and which satisfy ^/(U) = I. Take f{X) = 
^ f{U)UXU^^ , then there is a sequence {/„} C E[A!) such that 
UX) — T{X). 

Proof: r is normal, since the trace is normal, hence ultra-weakly 
continuous. 

If X G T(7i) be a trace class operators in a separable Hilbert space 
H, T(X) is the unique element of A such that 

Tr(r(X)y) =Tr(Xy), (104) 

for all Y e A, which implies that r{UXU-'^) = r{X) for all unitary 
U G A', hence T is ultra-weakly continuous r(X) = r{Y) for all 
Y G K{X). 
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The first statement results from Lemma 1; tlie last statement fol- 
lows from Lemma 2. 

Lemma 4 Tl{p\ Q is jointly convex in p and C,: If Aj > 0, ^ Aj = 1, 
then 

Proof: Since 

n^{p;C)^n^Ap) (106) 

= max{< p,Q> -p ln[-Tr(C exp Q)]}, (107) 
Q /J' 

'^CY m " max{< ^ AiPi, Q > - ln[Tr(^ A^Ci exp Q)]} 

(108) 

< max^Ai{< pi,Q > - ln[Tr(CiexpQ)]} (109) 
Q 

by additivity of inner product, < "^XiPijQ >= X^Aj < Pi,Q >, and 
convexity of logarithm function, 

= ^Xi max{< pi,Q>- ln[TV(0 exp Q)]} (110) 

by the positivity of for all 

= ^A^7^(p,;C^) (111) 

by the definition of Tl{pi]C,i). 

Lemma 5 Let P be a projection in H, and take pp = PpP, etc., 
then 

7^(/>p; Cp) + n{pi.p- C/-p) < 7^(p; C). (112) 
Proof: Note that U = 2P — / is unitary and that 

p' = pp + p,_P = ]^{p + U+pU). (113) 
Following Lemma 4, we have 

n{p'- C) < Inip; c) + Inu+pU; u^cu) = nip; c). (lU) 

Note that 

n{p'; CO > 7^(pp; Cp) + n{pi.p; Ci-P), (115) 
we reach the result 

n{pp; Cp) + 7^(p,_p; C/-p) < 7^(p; C). (116) 
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Lemma 6 Let P„ be a sequence of projections such that Pm < Pn 
for m < n, dimP„ is finite for all n, and P„ — ^ / strongly when 
n — > oo. Take /9„ = PnpPn, then the sequences {Tl{pn',Cn)} are 
monotonously increasing and 

n{pn;Cn)^n{p;0. (117) 

Proof: The monotonicity of {TZ{pn', Cn)} follows from Lemma 5. 
Recalling the facts, 

Tr(P„p2) _^ Trp2, (118) 

and 

< TtiPnipl-p^)] = Tr[Pnp{I-Pn)p] < TV[/(/-P„)] 0, (119) 
we have the result 

IV(p - p„)2 = 'IV(p2 - pI) = Tr[p\l - P„)] + Tr[Pnipl - p')] — > 0, 

(120) 

but — < Tr(p — p„)^, consequently ||p — p„|| — > 0, that is, the 
convergence p„ — ^ p is uniform. 

Note that TZ{p; C) is lower semicontinuous under the convergence 

{PnXn) ' ip;0, 

7^(p;C) <liminf7^(p„;Cn), (121) 
but from Lemma 5, we know that TZ{pn', Cn) < T^ip] C)) hence wc have 
liminf7^(p„;Cn)=7^(p;C)• (122) 
Lemma 7 Assume that {/fc} C £'(.4') satisfies 

/fe(p)— .r(p), (123) 

/fe(0^r(c) (124) 

weakly, then 

7^(^(p);^(c)) < \immfn{fk{pyjkiO) < ^(p;C)- (125) 

Proof: Since T is trace-preserving, use the spectral measure of r(p) 
(where positive operator p has the support projection /), there is a 
sequence of projections P„ in A satisfying the conditions of Lemma 6. 

Prom the definition of P, we have 

T{PnpPn) = PnTip)Pn. (126) 
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Since is bulit up of elements of A', we result that 

fkiPnpPn) = Pnnfk{p)]Pn. (127) 

In the finite dimensional space, Hn = PuH, fk{Pn) — ^ ^{Pn) is 
convergent uniformly, hence when k — > oo, 

7^(/fc(Pn);/fe(Cn)) ^^(r(p„);r(Cn)). (128) 

According to Lemma 6, we obtain that 

7^(p;C) =sup7^(p„;Cn), (129) 
hence Tl{p; C) is lower semicontinuous, that is 

7^(^(p);^(C)) < limmfTZ{fk{py,fk{0)- (130) 
Following Lemma 4, we have 

nfkip); fkiO) < Yl fk{u)n{upu+- ucu^) (i3i) 
= n{p-o (132) 

by the unitary invariance of TZ. 
Hence, 

7^(^(/>);^(c))<7^(p;C). (i33) 

Remark According to Lemma 7, it is hard to prove Monotonicity 
Theorem since we do not know if there is a sequence {f^} C E{A') 
which implements F on both p and 

Theorem (Monotonicity) For trace-preserving expectations F from 
B{7i) to a von Neumann subalgebra A, if p and C, are positive trace 
class operators in a separable Hilbert space TL, then 

7^(F(p);F(C))<7^(/9;C). (134) 

Proof: Choose a sequence of projections P„ G A, such that Pm < 
Pn for m < n, dimP„ is finite for all n, and P„ — > / strongly when 
n — > oo, and {fk} C E{A') such that fk{p) — *■ F(/9) weakly. Then 

fkiPn) r(p„) (135) 

in norm. 

For a given k, there exists gj C E{A') such that gjifkiC)] — ^ ^{0 
weakly when j — ^ oo, then 

9j[fk{Cn)]^r{Cn) (136) 



25 



in norm. 

If \\{fk — r)Pn|| < ^{k), choose gj^k such that 

\\{9j,kfk-mn)\\<eik). (137) 

Obviously 

\\i9,,kfk-T){pn)\\ = \\gj,kifk-r)iPn)\\ < Il(/fc-r)(p„)|| < eik). (138) 
Then = gj,kfk satisfies 

hkiPn) nPn), (139) 

hkiCn) r(Cn) (140) 

in norm. 

According to the proof of Lemma 7, we have 

7^(^(p„);^(c„)) <7^(p„;C„), (i4i) 

and further following Lemma 6, 

7^(^(p);^(c))<7^(p;C). (i42) 

Corollary Let be a set of mutually orthogonal projections in 
7i satisfying = / and the map F : p — > ^PkpPk is trace- 

preserving describing the interaction of a finite quantum system with 
a classical apparatus measuring an observable with eigen-spaces Pj-, 
then 

7^(^(p);^(c)) = ^n{PkpPk;PkCPk) < n{p-Q. (i43) 
6 Quantum Mutual Entropy 

Quantum mutual entropy was discussed in [CA 1997] via entropy 
diagram, which seems not starting originally from the information- 
theoretic sense. In the view of quantum control, quantum mutual 
entropy is extensively researched, starting from [BS 2005] and more re- 
cently [Belavkin 2001a, 2001b, BO 2001, BO 2002, SW 2006]. Belavkin 
and Ohya [BO 2001, 2002] introduced quantum mutual information as 
the von Neumann negaentropy Tl{w) = —S{w) of the entangled com- 
pound state related to negaentropy Tl{g'Si'i) = —S{g'Si^) of the prod- 
uct of marginal states, i.e. as the relative negaentropy 
-S^''\-cu;tp), in the sense of Lindblad, Araki and Umcgaki relative 
entropy [Lindblad 1973, Araki 1976, Umegaki 1962] with respect to 
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Naturally this approach treats quantum mutual entropy via entan- 
glement based on quantum relative entropy, therefore we can further 
define quantum mutual entropy via quantum entanglement. 

Definition We define the quantum mutual information X4^g(7r) = 
Ib,a{'^*) of both types in a compound state lo achieved by a quantum 
entanglement tt : H — ^ .4*, or by tt* : .4 — with 

g{A) = w{A ® /) = Tig[Ap],c;{B) = w{I ® = Trn[Ba] (144) 

as the quantum relative entropy of the state w on M = A<Si B with 
the respect to the product state (p = g^q: 

TA,B{T^)=n{u-p®a). (145) 

Theorem Let A : ;B ^ ^2 be an entanglement of the state 
a{B) = Tr[A(5)] to (^°,p°) with C £(^0), = on B, and 

vr = K*A be entanglement to the state p = p'^K on ^ C defined as 
the composition of A with the predual operator : A^ A^ nor- 
mal completely positive unital map K : ^ ^ AP , then the following 
monotonicity inequality holds 

^AB(vr) < ^^o,b(A). (146) 

Proof: This follows from the commutativity of the following dia- 
grams: 




Commutative diagram for en- Dual commutative diagram for 
tanglement tt entanglement tt* 

Applying the monotonicity property of our new quantum relative 
entropy on = A®B with respect to the predual map loq i-^ (K* 
Id)(a;o) corresponding to 1— > wq{K ® Id) as the ampliation K (g) Id 
of a normal completely positive unital map K: A^ AP. 

Definition The maximal quantum mutual entropy g(7rq) as the 
supremum 

HB{q)= sup XB,^(7r*) = Jee(7r*) (147) 

7r*(7)=(T 

over all entanglements tt* of any {A, g) to {B, <^) is achieved on = B, 
g*^ = ^ by the standard quantum entanglement 7r*(A) = a^^'^Aa^^'^ 
for a fixed <i{B) = TT-}i[Ba] is named as entangled, or true quantum 
entropy of each type of the state 
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Similar definition for Araki-Umegaki type can be found in [Belavkin 
2001a, 2001b; BO 2001, 2002], for Belavkin-Staszewski type in [BD 
2007]. 

Definition We call the positive difference 

HsiAW=HB{<^)-lA,Bi^) (148) 

entangled (or true quantum) conditional entropy respectively of each 
type on B with respect to A. 

Similar definition for Araki-Umegaki type can be found in [Belavkin 
2001a, 2001b, 2002; BO 2001], for Belavkin-Staszewski type in [BD 
2007]. 

7 Quantum Communication Channel 

Entanglement-assisted quantum channel capacity, or entangled quan- 
tum channel capacity is extensively researched, for example, entangled 
quantum capacity [Belavkin 2001a, 2001b; BO 2001, 2002] via a com- 
mon framework, entanglement-assisted quantum capacity [BSST 1999, 
2001] via entangled quantum mutual entropy. We further discuss en- 
tangled quantum channel capacity via quantum mutual entropy upon 
our quantum relative entropy and its additivity property. 

7.1 Quantum Channel Capacity 

Let B C C{Ti.) be the VF*-algebra of operators in a (not necessarily 
finite dimensional unitary) Hilbert space 7i. Generally we denote the 
set of states, i.e. positive unit trace operators in B{7i) by S{H), 
the set of all m-dimensional projections by VmCH) and the set of all 
projections by V{7i). 

Definition A quantum channel F is a normal unital completely 
positive linear map (UCP) of B into the same or another algebra 
B^ C B{7i^). These maps admit the Kraus decomposition, which is 
usually written in terms of the dual map F* : — ^ B^ as F*(ct'^) = 
Ylk^kCT^^l = r*(cr°) ([Stinespring 1955], [Lindblad 1973], [Holcvo 
1998a]), T{B) = Efc^fc-^^fe' ^r Ak are operators ^ H satisfying 

j:,aia, = i'. 

For example, quantum noiseless channel in the case B = C{TC), 
B^ = CiTi.^) is described by a single isometric operator Y : Ti.^ TL 
as T{B) = Y*BY. See for example [Holevo 1998a, Lindblad 1991] for 
the simple cases B = C{H), dim(?^) < oo. 

A noisy quantum channel sends input pure states ?o = ^0 on the 
algebra B^ = CiTi^) into mixed states described by the output densi- 
ties a = F*(c7°) on ^ C C{n) given by the predual F* = F* | B^ to the 
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normal completely positive unital map T : B ^ B which can always 
be written as 

r{B) = TrjF^ [Y^BY] . (149) 

Here y is a linear operator from (g) ^+ to H with Tr^^ [Y^^] = I, 
and ^+ is a separable Hilbert space of quantum noise in the channel. 
Each input mixed state is transmitted into an output state ? = (;^r 
given by the density operator 

r(^0) = y(a° ® I+)Y^ e B^ (150) 

for each density operator E B^, the identity operator /+ G T+. 

Wc follow [Belavkin 2001a, 2001b; BO 2001, 2002] to denote /Cg 
the set of all normal completely positive maps k : A ^ B^ with any 
probe algebra A, normalized as Ttk{I) = 1, and /Cq(?°) be the subset 
of K G /Cg with k(7) = a^. We take the standard entanglement tt^ 
on {B^,<;^) = {Ao,g'''), where Qo{Ao) = Tr[^oPo] given by the density 
operator pQ = cr^, and denote hy K a normal unital completely positive 

map A — > A^ = Ao that decomposes k. as k{A) = p^"^ K{A)p'q'^ . It 
defines an input entanglement k* = K^tt^ on the input of quantum 
channel as transpose-completely positive map on = ^ into A* 
normalized to p = K^:p^, p^ = pQ. 

The channel F transmits this input entanglement as a true quan- 
tum encoding into the output entanglement tt = K^TTgT = K^X map- 
ping B via the channel T into ^* with 7r(/) = p. The mutual entangled 
information, transmitted via the channel for quantum encoding k is 
therefore Ja,b{^^*'^) = JA,B{K*^q'^) = JaM^*^)^ where T = tt'^T is 
the standard input entanglement 71^(3) = a^^'^Ba^^'^ with ao = a^, 
transmitted via the channel F. 

Theorem Given a quantum channel T : B ^ B^, and an input 
state c;^ on B^, the entangled input-output quantum information via 
a channel T : B B^ achieves the maximal value 

J{^,r)= sup :7(k*F) = X^o,b(F), (151) 

where A = vr^F is given by the corresponding extremal input entan- 
glement 7r° mapping i3° = A^ into ^° = B^ with Tr[7rg(5)] = <jO(5) 
for all BeB°. 

Note that similar theorem for Araki-Umegaki type can be found in 
[Belavkin 2001a, 2001b; BO 2001, 2002], for Belavkin-Staszewski type 
in [BD 2007]. 
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The following definition depends on the commutativity of dia- 
grams: 




Commutative diagram for 
quantum channel T 
with standard entanglement 
TT^ = TT^ for A = B'^ 




Bl—^B. 

Dual commutative diagram for 

quantum channel T 

with standard entanglement 

ttO for ^0 = Bl 



Definition Given a quantum channel T : B ^ B^, and a input 
state on B^, we can define the input-output quantum entropy as 
the maximal quantum mutual information 

j(^°,r)=Xeo,B«r) (152) 

for input standard entanglement of the state to the state = 

Note that similar definition for Araki-Umcgaki type can be found 
in [Bclavkin 2001a, 2001b; BO 2001, 2002], for Belavkin-Staszewski 
type in [BD 2007]. 

7.2 Additivity of Quantum Channel Capacity 

Here and below for notational simplicity we implement the agreements 

Ai = Bl Ql = At = ®UBl q1 = such that = ^^iP? 

is transposed input state 5^ = (8)-Li^^ on B'^ = (8)-Li.4? with B^ = 

AO = ni — li _ „0 — _ ~i 

Let Fj be channels respectively from the algebra Bi on Hi to B^ 
on for z = 1, 2, n, and let = (8)"=iFj be their tensor product. 

We have the additivity property of this entangled input-output 
quantum entropy upon the monotonicity property. 

Theorem Let F® be product channel from the algebra B® = 
®F=i'Si to Aq = i^if^iAi, and let = <Si2=iQo be the tensor product 
of input states on Bq, then 

n 

J(^?,n = EWo,rO. (153) 

i=l 

Proof: Take F^, : B^ ^ Bi,, and g}^ G B^, q = Ti,{gi^) G Bi,, 

and Xi"^ : Af Ai"\ where A^ = ^Li^^E.' but ^1"^ is pred- 
ual to a general, not necessarily product algebra A^'^^ C C{Q^"'^). 
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For 7r(") = KrVo^r®, below we consider quantum mutual entropy 
X_4(„) ^^(vr^")) as quantum relative entropy 

7^((K^) ® rf)uf : Ki''\af)(^Tf{gf)), (154) 

where = ^"^j^Sq is the density operator of the standard compound 

state (8)f=itu^ with ti7^(A® A) = uj^{Ai^Bi) = Tr[Bi^Ai^] for 

Ai e B'^ , Bi e Bf , corresponding to (t° = pg- 

Applying monotonicity property of quantum relative entropy to 
the probe system (Q^^\A^'^^) for this given Qq and Fj, we obtain 

7^((K^) 0F?)So®;Ki")(a^)®Ff (;9^)) (155) 
< 7^((/d® F®)^^; /(i®(a®) ® Ff (p^)) (156) 

n 

= ^ 7^((/d ® Fi,)(So); ® ri,(p^)), (157) 

where = p° = p^, ph = <Ji = ^o- 

The suprema over is achieved on K^"'^ = Id^ identically 

mapping A^'"^ = <8)f=i.4'o to Bf^ = <8)"=iH^, where B^ = B^, coinciding 
with such due to Ai = B^. 

Thus J{Qt^^) = J:■=lJieO,^^)■ 

Definition Given a normal unital completely positive map F : 
B ^ A, the suprema 

Cq{r) = sup X^,b(/«*F) = sup J(?o, F) (158) 

is called the quantum channel capacity via entanglement, or q-capacity. 
Let F® be product channel from the algebra = -Li-Bi to Aq = 
The additivity problem for quantum channel capacity via 
entanglement is if it is true that 

n 

C,iT^) = ^C,iT,). (159) 

i=l 

In the spirit of [Belavkin 2001a, 2001b; BO 2001, 2002; BD 2007], 
we prove this additivity property upon the monotonicity property of 

our quantum relative entropy. 

Theorem Let F® be product channel from the algebra B^ = 
to A^ = then 

n 

C,{T^) = J2c,{ri). (160) 

i=l 
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Proof: It simply follows from the additivity (jl53p . Indeed, 

n 

C,{T^) = sup J^(„) g(KT®) = sup J(^?®,r®) = sup^ j(^>^o,r.) 

(161) 

Therefore by further taking suprema over Qq as over independently 
for each i = 1,2, ...,n, thus we have 



c,(r®) = J2^npj{Qi,r,) = J]c,(r,), (i62) 

i=l Qo 1=1 

which is the additivity property of entangled quantum channel capac- 
ity due to encodings via entanglement obviously. 

Remark Note that there is no such additivity for Holevo channel 
capacity for a arbitrary channel T : B ^ B^. Indeed, this capacity is 
defined as the supremum 

CdiT) = sup Ia,b{^*T) (163) 

over the smaller class ICd C /Cg of the diagonal (semiclassical) encod- 
ings K : A ^ B^ corresponding to the Abelian algebra A. This supre- 
mum cannot in general be achieved on the standard entanglement of 
A^ = B^ = Bq if A^ is non Abelian corresponding to the non Abelian 
input algebra B^. Therefore the supremum Cd(r®) < Yli=i^d(^i) can 
be achieved not on a product Abelian algebra A^"^^ as is was in the true 
quantum case where we could take = (^f^iB^ with non Abelian 
Bl^ = Bl 



8 Conclusion 

This work so far is just a game theoretic-application to quantum in- 
formation. 

In the introduction to strategic game, we obtained a sufficient con- 
dition for minimax theorem, but it is still worth to explore the neces- 
sary condition of minimax theorem; 

In the exploration of classical/quantum estimate, we found the 
existence of the minimax value of the game and its minimax strategy, 
but it is still interesting to discuss the existence of maximin value 
and its maximin strategy. One further problem is to give the general 
bounds on those values; 

In the view of convex conjugate, we arrived at one approach to 
quantum relative entropy, quantum mutual entropy, and quantum 
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channel capacity and discussed the monotonicity of quantum rela- 
tive entropy and the additivity of quantum communication channel 
capacity, and it is still worth to explore other properties and their 
bounds. 
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